Finding All Allowed Edges in a Bipartite Graph 
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Abstract 



We consider the problem of finding all allowed edges in a bipartite graph G = {V, E), i.e., all 
edges that are included in some maximum matching. We show that given any maximum matching 
in the graph, it is possible to perform this computation in linear time 0{n + m) (where n = \V\ 
^Sj \ and m = \E\). Hence, the time complexity of finding all allowed edges reduces to that of finding a 

single maximum matching, which is 0{'n}''^m) (Hopcroft andKarp HlOV ). or 0{{n/ log n)^'^m)for 
dense graphs with m = 0(n^) (Alt et al. jJiTS). This time complexity improves upon that of the best 
known algorithms for the problem, which is 0{nm) (Costa ^ for bipartite graphs, and Carvalho 
and Cheriyan S^ for general graphs). Other algorithms for solving that problem are randomized 
Y^ • algorithms due to Rabin and Vazirani hlH and Cheriyan ^, the runtime of which is 0(n^'^^^). 

Our algorithm, apart from being deterministic, improves upon that time complexity for bipartite 
graphs when m = 0{n'') and r < 1.876. In addition, our algorithm is elementary, conceptually 
simple, and easy to implement. 
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1 Introduction 



Let Vi be a set of men and V2 be a set of women that are registered in a matchmaking agency. 
Each of the men (women) has an associated description and given preferences regarding the woman 
(man) with whom he (she) wishes to get acquainted. Those descriptions and preferences induce a 
r> ' bipartite graph G of compatibilities on the set of nodes V = Vi U V2. The agency then presents 

j^ ■ to the clients the links that are relevant to them, so that they can choose whom to meet. Some of 

those links may not be extended to a maximum matching in the bipartite graph. The agency has 
an incentive to identify those links upfront and not offer them to its clients, since if they would 
be offered and one of them would be successful, it would prevent the possibility of achieving a 
maximum matching. Therefore, given the full bipartite graph G, it is needed to remove from it all 
edges that are not part of a maximum matching, or, alternatively speaking, it is needed to compute 
the subgraph of G that equals the union of all maximum matchings in G. 

In order to formalize this problem, we recall some of the basic terminology regarding matchings 
in graphs. 
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Definition 1.1. Let G = {V, E) be a graph. 

• A matching in G is a collection Ad Q E of non-adjacent edges. 

• A matching M is called maximal if it is not a proper subset of any other matching. 

• A maximal matching M is called a maximum (cardinality) matching if there does not exist a 
matching with a greater cardinality. 

• A maximum matching M is called perfect if it covers all of the nodes in V. 

• An edge e £ E is an allowed edge if it is included in some maximum matching. 

We study here one of the fundamental problems in matching theory, namely, the identification of 
all allowed edges in a given graph G. We focus here on bipartite graphs, i.e. G = {V, E) where 
y = Fi U V2, Fi n V2 = 0, and E C Vi X V2, and devise an efficient algorithm for that problem in 
such graphs. 

Our main contributions herein ai^e summarized as follows: In case we aie given one of the maxi- 
mum matchings in the bipartite graph G = {V, E), Algorithm [3] in Section|2]finds all allowed edges 
in linear time 0{n-\- m), where n = |y| and m = |£'|. (In the application that motivated this study 
there was indeed one "natural" maximum matching that was known without any computation, see 
Section [5]) Without such prior knowledge on the graph, we may first find any maximum matching 
in the graph, and then proceed to apply Algorithm [3] in order to complete the computation in addi- 
tional linear time. Algorithm [T] implements this approach. Its runtime is dominated by the runtime 
of the procedure for finding a maximum matching. 

Algorithm 1 Finding all allowed edges in a bipartite graph G = {V, E). 
Input: A bipartite graph G = {V,E). 
Output: All allowed edges in E. 

1: Find a maximum matching in G, say M. 

2: Using M, invoke Algorithm [3] in order to find all allowed edges in G. 

The Hopcroft-Karp algorithm lITOl offers the best known worst-case performance for finding a 
maximum matching in a bipartite graph, with a runtime of 0{n^^^m). For dense bipartite graphs, a 
slightly better alternative exists: An algorithm by Alt et al. HI finds a maximum matching in a bipar- 
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tite graph in time O ( n^'^ ( j^p^ ) ) . In cases where m = 6(n^), it becomes 0{{n/ log n) 

whence it is a (log n)^/^-factor faster than the Hopcroft-Karp algorithm. 

A faster algorithm for finding a maximum matching in bipartite graphs was recently proposed by 
Goel et al. lUl for the special case of d-regular graphs (graphs in which all nodes have degree d). 
However, in such graphs all edges are allowed (see more on that in Section iBt. 

The paper is organized as follows. The algorithm for finding all allowed edges from any maxi- 
mum matching in the bipartite graph is given in Section |2l In Section [3] we consider the problem 
in a dynamic setting and discuss an efficient implementation in such settings. In Section |4] we dis- 
cuss related work and compare our algorithm to the leading algorithms. We conclude in Section 
\5\ in which we describe problems from data privacy that motivated this study and suggest future 
research. In the Appendix we comment on the size of matching to which a given set of non-adjacent 
edges can be extended (Section |A|, and prove that in regular bipartite graphs all edges ai^e allowed 
(Section E). 



2 A linear time algorithm for finding all allowed edges from any maximum matching 

We begin by considering in Section IZT] the case of balanced graphs i\Vi\ = | V2I) with a perfect 
matching. In Section |2!2] we consider the general case. 

2.1 Bipartite graphs with a perfect matching 

Let us denote the nodes of the graph as follows: Vi = {vi, . . . , va} and V2 = {^'i, ■ ■ ■ , v'^^} 
(here, h = ^). The graph G is assumed to have at least one known perfect matching; without loss 
of generality, we assume that it is M := {{vi,v'i), . . . , {vn, v^)}. 

Definition 2.1. A set ofi > 1 edges in the graph G is called an alternating cycle (with respect to M) 
if there exist i distinct indices, 1 < i 1 ,...,?£< n, such that the I edges are {vii,v'^ ), {vi^,v'- ),. . ., 
ivk.i,vl),{vi„v'ij. 

Tlieorem 2.2. Let G = {V,E) be a bipartite graph where V = Vi\J V2, V\ = {vi, . . . ,Vn}, 
V2 = {v'l, . . . , v'f^}, and E C Vi x V2. Assume further that M := {{vi,v'i), . . . , {vn,v!^)} C E. 
Then an edge e a E is allowed if and only if it is included in an alternating cycle. 

Proof. Assume that e = (fji,^^ ) is part of an alternating cycle, say {vi^jV^ ), {vi^,v[ ),..., 
{vii,_-^,v[ ), {vi^,v[ ). If we augment this alternating cycle with the fi — i. edges {vi,v[) for all 
z ^ {ii, . . . , i^} we get a perfect matching. Hence, e is allowed. 

Assume next that e = {vi-^,v'^^) is allowed. Then it is included in some perfect matching M' . 
We proceed to define a sequence of edges Se ^ M' in the following manner: ei := e = (u j^ jv'^); 
then, for all j > 1, if Cj = {vi.,v[ _^), we set ej+i := (^ij+n'^i , 3), where z;[ is the node that 
M' matches with Vi,^^. Since M' is finite, the sequence must repeat itself at some point. Namely, 
there must exist a minimal index jo > 2 such that e^Q equals one of the previous edges in Se. It is 
easy to see that ej^^ must coincide with ei since if e^p would coincide with, say 62, then M' would 
have included two different edges that are incident to Vi^. But then the sequence Se up to that point, 
Se = {ei = e, 62, . . . , Cjo-i} is an alternating cycle with respect to M. Hence, every allowed edge 
must be contained in an alternating cycle. D 

Next, we define the directed graph H = (U, F) that is induced by the bipartite graph G = {V, E). 
In the directed graph H = [U, F) the set of nodes is U = {ui, . . . , ua} and [ui, Uj) is a directed 
edge in that graph if and only if i 7^ j and (uj, w' ) € E. See example of a bipartite graph G and the 
corresponding directed graph H in Figure [U 

It is easy to see that, in view of Theorem 12.21 an edge {vi , u' ) G i? is allowed in G if and 
only if i = j or the edge (u, ,Uj) S F is part of a directed cycle in H. For example, the 
edge (f3, v'^) is allowed in G that is given in Figure [T] since it is part of the perfect matching 
{{vi,v[), {v2,v'^), (^3,^4), (^4,^^2)1 (i'^ which the last three edges are an alternating cycle), and 
indeed the corresponding edge (U3, U4) is part of a directed cycle of length 3 in H. 

Therefore, the problem of finding all allowed edges in G reduces to the problem of finding all 
edges in the directed graph H that are part of a directed cycle. This may be achieved as follows: 
First, one has to find all strongly connected components of H; namely, all maximal strongly con- 
nected subgraphs of H (recall that a directed graph is strongly connected if there is a path from 
each node in the graph to every other node). If each strongly connected component is contracted 
to a single node, the resulting graph is a directed acyclic graph. Consequently, a given edge in 
H is part of a directed cycle if and only if it connects two nodes in the same strongly connected 
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Figure 1. A bipartite graph G and the corresponding directed graph H 



component. There are several efficient algorithms for finding the strongly connected components of 
a given directed graph. Tarjan's algorithm il21 and Cheriyan-Mehlhorn-Gabow algorithm ^ are 
both equally efficient with a linear runtime. Finally, we use those findings to mark all allowed edges 
in the original bipartite graph G. Algorithm |2] does all of the above. Its runtime is 0{n + m). 

Algorithm 2 Finding all allowed edges in a bipartite graph, given a perfect matching. 



Input: A bipartite graph G = (V, E) where V 
£' C 14 X V2, and for all 1 < i < n, {vi^v'^^ 
Output: All allowed edges in E. 
1 

2: 
3: 
4: 
5: 
6 
7: 



= Vi U V2, Vr 

£E. 



{vi,...,Vn},V2 = {v[,...,v'f^}, 



Mark all edges in E as not allowed. 

Construct the directed graph H = {U, F) that con^esponds to G. 

Find all strongly connected components of H. 

for all edges {ui,Uj) £ F do 

if Ui and Uj belong to the same strongly connected component in H then 
Mark the edge {vt, v',) £ E as allowed. 

end if 
end for 
Mark all edges {vi,v'j) G E as allowed, 1 <i <n. 



2.2 General bipartite graphs 



After dealing with balanced bipartite graphs that have a perfect matching, we turn our attention 
to the general case. Hereinafter we let G = (F, E) be a bipartite graph where the two parts of the 
graph are Vi = {vi, . . . , t;„^} and V2 = {^ii ■ ■ ■ , ^'^2^' ^1 — ^^2. and the maximum matchings are 
of size t < rii. We may assume that t < n2 since if t = 722, G is a balanced bipartite graph with a 
perfect matching, and we already solved the problem for such graphs. Let M be a given maximum 
matching in G. We may assume, without loss of generality, that M := {{vi,v[), . . . ,{vt,v^)}. 

Definition 2.3. A node vi or v[ is called an M -upper node ifi<t and an M -lower node ifi>t. 
An edge (vi , f ' ) in G is called an M -upper edge if it connects two M -upper nodes; all other edges 
are called M -lower edges. 

In other words, a node is called Af -upper if it is covered by M, and called M-lower otherwise. 
(The terminology simply reflects the fact that the so-called M-upper nodes appear in the upper part 



of the graph.) Consider, for example, the graph in the left of Figure |2] In that graph ni = n2 = 4, 
t = 3, and M = {{vi,v[), (f2, 1^2)' (''^Si ''^s)}' the graph has six M-upper nodes and two Af -lower 
ones. The edges {v3,v'^) and (174, v[) are M-lower edges, while all other five edges are Af -upper. 





Figure 2. A bipartite graph G and the corresponding restriction Gu 

For simplicity, we shall use hereinafter the terms upper and lower without specifying the prefix 
M. It should always be understood that a node or an edge aie upper or lower with respect to the 
given M. 

Our first observation towards a classification of all allowed edges in such a graph is as follows. 

Proposition 2.4. Let G = {V, E) be a bipartite graph as described above. Then: 

1. The graph has no lower edges that connect two lower nodes. 

2. All lower edges are allowed. 

Proof. The first claim is obvious since if there was a lower edge {vi,v'A where i,j > t, then 
M U {{vi,v'A} would have been a matching, thus contradicting the assumed maximality of M. As 
for the second claim, let {vi , f ' ) be a lower edge with i > t. In view of the first claim, v'- must 

be an upper node. Therefore, (M\j{{vi,Vj)}j \ {{vj,Vj)} is also a maximum matching, whence 
{vi , u' ) is an allowed edge. D 



As Proposition 12.41 determines that all lower edges are allowed, it remains to identify the al- 
lowed edges among the upper edges. Let Gu denote the restriction of G to the upper nodes, 
{vi,Vi, . . . , vt,Vf}. We distinguish between two types of allowed edges among the upper edges: 

Definition 2.5. An upper edge is an allowed edge of type I (with respect to M) if it is included in a 
maximum matching that consists only of upper edges. An upper edge is an allowed edge of type II 
(with respect to M) if it is allowed, but all maximum matchings that include it include also a lower 
edge. 

For example, consider the graph G on the left of Figure |2] and the corresponding Gu on the right. 
The edge ( (73 ,v'^)h not allowed, but all other six edges are allowed: 

1. The edges (^3, v'^ and (774, v'^) are allowed since they are lower edges. (For example, (^3, t;^) 
may be extended to a maximum matching with {vi,v'^) and (t;2, ^2)-) 

2. The edges {vi,v'i), (^2,^2). and (^3,^3) ai^e allowed edges of type I since they are allowed 
edges also in Gu- 



3. The edge (^2,^3) is an allowed edge of type II since it is allowed (together with {vi,v[) and 
{v3,v'^) it forms a maximum matching), but it is not allowed in Gu- 

All allowed edges of type I can be identified by applying Algorithmic on the subgraph Gu (since 
Gu is a balanced graph with a perfect matching). It remains only to identify the allowed edges of 
type II. 

Definition 2.6. A set P of i — I > 1 upper edges in G is called an upper alternating path (with 
respect to M) if there exist i distinct indices, 1 < ii, ■ ■ ■ ,ie <t, such that 

P = {{vii.v'i^), (ui2, u-g), . . . , {vi^^^^v'i^)} . (1) 

Ifio>t and A = (vjo , f ^ ) is a lower edge in E then {\]VJP is called a left-augmented alternating 
path. Ifie^i > t and p = {vi^ , v'^ ) is a lower edge in E then P U {p} is called a right-augmented 
alternating path. Finally, we let AV denote the set consisting of all alternating paths of all three 
sorts — upper, left- and right-augmented. 

For example, P = {(112, ^3)} is an upper alternating path of length 1 in the graph G of Figured 
while P U {(^13, W4)} is a corresponding right-augmentation. 

Theorem 2.7. Any allowed edge of type II with respect to M must be contained in an either left- 
or right-augmented alternating path in G with respect to M. On the other hand, an edge that is 
contained in an either left- or right-augmented alternating path is allowed. 

Proof. Let e = {vi, v[, ) be an upper edge which is an allowed edge of type II with respect to M. 
Let Me be a maximum matching that includes e. Define 

AVe = {P e AV : P Q Me and e G P} ; 

namely, AVe consists of all alternating paths (upper, left- and right-augmented ones) that are con- 
tained in Me and include e. Clearly, AVe is nonempty since it includes the upper alternating path 
of length 1 that consists only of e. Hence, we may select a path P € AVe of maximal length among 
all paths in AVe. We claim, and prove below, that P cannot be an upper alternating path. Hence, P 
must be either left- or right-augmented alternating path. Since P includes e, that will settle the nec- 
essary condition in the theorem: Any allowed edge of type II with respect to M must be contained 
in an either left- or right-augmented alternating path with respect to M. 

Assume that P is an upper alternating path. Denoting its length hy i — 1, P must take the form 



Vi 



«<?■ 



where 1 < ii,. . . ,i£ < t. We claim that the matching Me does not match neither v[ nor 
Assume, towards contradiction, that Me includes an edge {vj, v[ ). Then j cannot be any of the 
indices outside the set {ii, . . . ,ii}, as that would contradict the assumed maximality of P. In 
addition, j cannot be any of the indices in {ii , . . . , i(,-i} since then Me would include two edges that 
are adjacent to Vj. Finally, j cannot be ii since then e would have been contained in an alternating 
cycle in Gu (Definition 12.11 ) and then, by Theorem 12.21 e would have been allowed in G„, thus 
contradicting our assumption that e is an allowed edge of type II. Therefore, Me does not match v[ . 
For the same reasons. Me cannot match vi^. But then, consider the set of edges 

M^ = (Me\P)U{(7;i,,<,):l<A;<n. 



In view of the above discussion, Mg is a matching (since Mg \ P does not cover any of the nodes Vi^, 
and v',i ,1 < k < t). But as |Mg| = |Me| + 1, it contradicts our assumption that Mg is a maximum 
matching. 

Next, we turn to prove the sufficiency of the condition; namely, that an edge that is contained in 
an either left- or right-augmented alternating path with respect to M must be allowed in G. Let 

P = {Ko> ^U' Ki ' ^U' (^^2 > -^U' • • • ' K-i ' ^U) 
be a left-augmented path in G (namely, iq > t and ik <t for all 1 < fc < i). Then 

Mo = P U {(t;,,^;.) : j G {1, . . . ,n \ {n, • ■ • ,^4} (2) 

is a matching of cardinality t. Hence, all edges in P are included in a maximum matching, whence 
they are all allowed. (One of those allowed edges, namely {vif^,v[_^), is a lower edge, while all the 
other upper edges may be allowed edges of either type I or II). The proof for right-augmented paths 
is similar-. D 

Hence, in order to identify all allowed edges of type II with respect to M, we have to scan all 
left- and right-augmented alternating paths and mark all upper edges along them as being allowed 
(some of those allowed edges may be of type I, whence they will be "discovered" twice). To do 
that, we use again the representation of G as a directed graph H = {U, F). 

Definition 2.8. Let G = {V, E) be a bipartite graph where the two parts of the graph are Vi = 

{vi, . . . ,fni} and V2 = {v[, . . . , v'^^}, and ni < n2. The corresponding left-to-right directed 
graph is Hm = {U, Flr) where U = {ui, . . . , Un^} and {ui,Uj) S Flr if and only if{vi, v'A € E. 
The right-to-left directed graph is Hfji = {U, Frl) where U = {ui, . . . ,Un^} and {ui,Uj) G Ffn 
if and only if{vj,v[) € E. 

With those definitions, an edge {vi,v'A is on a left-augmented (resp. right-augmented) alternating 
path if and only if the edge {ui,Uj) is reachable in Hlr (resp. Hrl) from one of the nodes u^, 
where t -\- 1 < k < ri2. Hence, all that is needed is to perform a BFS of Hiji (resp. Hrl) from 
those nodes and for every edge (uj , Uj ) that we visit along this scan, mark the coiTcsponding edge 
{vi,v'j) (resp. {vj,v^)) in G as an allowed edge. The runtime of such a scan is linear. Algorithm[3] 
summarizes the procedure of finding all allowed edges in a general bipartite graph. 

3 The dynamic setting 

Going back to the opening example of the matchmaking agency, if one of the allowed edges 
materializes and another couple of clients leaves happily the matching game, the agency has to 
update the set of allowed edges in the reduced graph. Here we comment on how to cope with such 
dynamic updates efficiently. Before doing so, we describe another interesting example. 

A domino board is a bounded region D in the Eucledean plane which is the union of unit squares 
{Si J := [i, i + 1] X [j,j + 1] for some i,j G N) that are connected in the sense that there exists a 
path between the centers of every two unit squares that is fully contained within the interior of D. 
Each unit square Sij is colored white in case i + j is even, or black otherwise. A perfect tiling of 
the region D is a cover of D by \D\/2 non-overlapping dominos {\D\ being the area of D), where 
dominos are shapes formed by the union of two unit squai^es meeting edge-to-edge. 

Now, let us consider the following computerized game: A domino region that has a perfect tiling 
is presented to the player, who needs to tile it by placing dominos on it, one at a time. If he places 



Algorithm 3 Finding all allowed edges in a general bipartite graph, given a maximum matching. 
Input: A bipartite graph G = {V, E) where V = V1UV2, Vi = {vi,..., t;„J, V2 = {v[, . . . , <J, 

i? C Vi X V2, and {{vi,v[) : 1 < i < t} is a given maximum matching in G. 
Output: All allowed edges in E. 
1: Mark all lower edges as allowed. 
2: Mark all upper edges as not allowed. 
3: Apply Algorithm [2] on the restriction Gu of G to {fi,f'^, . . . ,vt.,v[}, thus marking all upper 

edges that are allowed in Gu- 
4: Construct the left-to-right directed graph Hlr. 

5: Add to Hlr a new source node uq and connect it to each of the nodes u^, where t+1 < k < ni. 
6: Apply a BFS on the graph Hlji starting from uq. For each edge {ui,Uj) that the BFS visits, 

mark the corresponding edge {vi , f ' ) in G as allowed. 

Construct the right-to-left directed graph Hrl. 

Add to Hrl a new source node uq and connect it to each of the nodes u^, where t+1 < k < n2. 

Apply a BFS on the graph Hrl starting from uq. For each edge (uj, Uj) that the BFS visits, 

mark the corresponding edge {vj, f^) in G as allowed. 



a domino in a location that prevents any completion of the tiling, the computer issues an alert and 
then the player has a chance to try again. The player wins if he was able to complete the tiling 
successfully with a number of bad moves below some predefined threshold. 

Here too, there is an underlying bipartite graph, where each node represents a white or a black 
square in D. Each square is connected to any of its neighboring squares of the opposite color. A 
perfect tiling of Z? is a perfect matching in that graph. The bad moves correspond to edges in that 
graph that are not allowed. In each step in the game, the player places one domino; it is then needed 
to remove the corresponding nodes and adjacent edges from the underlying graph and update the 
list of allowed edges in the reduced graph. 

Following the two motivating examples, we now formulate the problem in the dynamic setting. 
Let G be a bipartite graph as discussed in Section 12.21 Assume that we already found a maximum 
matching in G, say M := {{vi,v[), . . . , (vtjv'-i.)}, and identified all allowed edges in G. In addition, 
we may assume that each allowed edge is marked by its type: In balanced graphs with a perfect 
matching all allowed edges are of the same type, but in other graphs we distinguish between allowed 
edges that are lower edges, and allowed edges that are upper edges of type I or of type II. Let {vi , v', ) 
be an allowed edge in G and let G' be the graph that is obtained from G by removing Vi, v',, and 
all adjacent edges. We wish to find a maximum matching in G' in an efficient manner, in order 
to proceed and identify in linear time all allowed edges in G' . We separate the discussion to three 
cases: 

• If {vi , v'- ) is an allowed edge of type I, we look in the directed graph Hlr for a path from Uj 
to Uj. Assume that the path is 

[Uj , Ujj ) , [Uj-^ , Uj2 j , . . . , \Uj^_^ , Uj^ ) , [Uj^ ,Ui)- 

Then it is easy to see that 

M \ {ivk,v'k) ■■ k G {i,j,ji,. . . ,ji}} U {{vj,v'jj, {vj,,v'j^), ..., ivj^_^,Vj^), (wj,,w-)} 
is a maximum matching in G'. 



• If {vi,v'j) is an allowed edge of type II, then Theorem 12.71 implies that either the edge {ui,Uj) 
is contained in a path in Hlji that ends with a lower node, or the edge {uj,Ui) is contained in 
a path in Hrl that ends with a lower node. Hence, we may apply a BFS on Hlr starting in Ui 
in order to look for a path that connects it to a lower node; if none is found, a similar- BFS on 
Hrl, starting in Uj, is guaranteed to find a path that reaches a lower node. Once such a path 
is found, we may reconstruct the corresponding maximum matching Mq in G that contains 
the edge {vi,v'A (see Eq. Q). Clearly, Mq \ {{vi,v'-)} is a maximum matching in G'. 

• Finally, if {vi,v'A is a lower edge then M \ {{vi,v^)} is a maximum matching in G' in case 
i < t, or M \ {{vj,v'j)} is, in case j < t. 

4 Related work 

Costa Q presented an algorithm for decomposing the edge set of a bipartite graph G = {V,E) 
into three disjoint partitions, E = EiU E^ U £"0, where Ei contains all allowed edges that belong 
to all maximum matchings in G, E.^ contains all other allowed edges, and Eq consists of the non- 
allowed edges. The runtime of her algorithm is 0{nni). Carvalho and Cheriyan f5|| designed an 
algorithm with the same time complexity for finding all allowed edges in general graphs, using 
well-known results on efficient implementations of Edmonds' maximum-matching algorithm and 
other results from the matching folklore. 

Rabin and Vazirani (TPl designed a simple randomized algorithm for finding all allowed edges in 
perfectly-matchable general graphs. We outline their algorithm: Let G = {V, E) be the input graph. 
Its Tutte matrix is defined as the following n x n matrix, 

{Xij {vi,Vj) e E , i> j 
-Xij {vi,Vj) e E , i <j , 
otherwise 

where Xij are indeterminates. Let p > n^ be any prime, S{A) be a random substitition of all 
Xij with elements from GF{p), and B = S{A)~^. Then with probability at least e~^'" , the 
matrix B identifies the set of allowed edges, in the sense that {vi,Vj) is an allowed edge if and only 
if Bij / 0. The runtime of this algorithm is determined by matrix inversion time and it equals 
Q^j^2.376^ Cheriyan |2J designed a similar algorithm with the same runtime that applies for any 
general graphs (i.e., not necessarily ones with a perfect matching). 

Our algorithm is deterministic. Its runtime is determined by the time to find a single maximum 
matching, which is 0{n^/'^m) (Hopcroft and Karp [10|) or 0((n/logn)^/^?n,) for dense graphs 
with m = ©(?i^) (Alt et al. |[ll). Hence, it improves upon that of HIH by a factor of n^^^ (or 
(jilogn)^/^ for dense graphs, where m = 0(n^)). In comparison to the randomized algorithms, 
our algorithm is faster when m = 0{n^) and r < 1.876, since then its runtime is 0{n^^^'^), 
where r + 1/2 < 2.376. In addition, our algorithm is elementary, conceptually simple, and easy to 
implement. Another advantage of our algorithm in comparison to the above mentioned algorithms 
is that in cases where there is one "natural" maximum matching, which is known without invoking 
a costly procedure for finding one, the runtime of our algorithm is only 0{n + m). The runtime 
of all of the above mentioned algorithms remains the same even in such cases. (Examples of cases 
where one maximum matching is known upfront are described in Section |5l) 

Identifying all allowed edges is equivalent to computing the union of all maximum matchings 
in the graph. Another line of research concentrated on algorithms for enumerating all maximum 
matchings in bipartite graphs, e.g. lIUlTSlO. Naturally, the runtime of such algorithms depends 



on the number of the counted matchings (perfect matchings in E O and maximal, maximum, or 
perfect matchings in fT3\). We recall in this context that counting the number of perfect matchings 
in a bipartite graph is equivalent to computing the permanent of a {0, l}-matrix, a problem that is 
known to be in #P-complete JTSl . 

5 Epilogue 

This study was motivated by a problem of providing anonymity in databases 171. Consider a data 
owner that has a database of records that hold information about individuals in some population. 
He needs to publish the database for the purpose of analysis and public scrutiny. In order to thwart 
adversarial attempts to identify the individuals behind some of those records, he generalizes the data 
in those records so that each generalized record would be consistent with the personal information 
of more than one individual. In order to ensure a minimal level of anonymity, the data owner wishes 
to guarantee that each generalized record would be consistent with the personal information of no 
less than k individuals, for some given threshold k. As the adversary knows that there is a one- 
to-one mapping between the population of individuals and the records in the generalized database, 
this setting may be modeled by a bipartite graph that has a perfect matching. Hence, the adversary 
knows that all edges in the bipartite graph that are not allowed can be ignored, as they cannot stand 
for a true linkage between an individual and his record in the table. Therefore, the data owner 
needs to make sure that even after the removal of such non-allowed edges from the bipartite graph, 
the anonymity constraint is still respected. Since the data owner knows the true perfect matching 
(because he has the raw data and he performs on it the generalization), he may use Algorithm |2] to 
identify all non-allowed edges efficiently and then verify that the generalized version of the database 
that he created provides the required level of anonymity. 

Another example from the realm of privacy and anonymity is the work of Yao et al. Ll61. In that 
work, they consider the case in which multiple views of the same database need to be released and it 
is needed to check whether their release would violate some privacy constraint (/c-anonymity). The 
algorithm which they suggest needs to identify all edges in a bipartite graph that are part of some 
complete matching. (A matching in a bipartite graph is called complete if it covers all nodes in 
the possibly smaller part of the graph.) They suggest to do it using a naive approach: Testing each 
edge in the graph whether its removal leaves a graph that still has a complete matching. Such an 
approach entails invoking the Hopcroft-Karp algorithm for each edge, whence its time complexity 
is 0(n^' ^771^). However, in the case that was studied there, the data owner knows one complete 
matching in the underlying bipartite graph. Whence, he may use Algorithm |3]in order to perform the 
same task in 0(n + m). The improvement factor in runtime is 0{n^'^m). As in such applications 
the graphs tend to be very large, such an improvement offers a practical solution instead of an 
impractical one. 

An interesting research problem that this study suggests is to extend this approach to the general 
case, namely to non-bipartite graphs. The question is whether a similar algorithm that uses one 
known maximum matching can be devised in order to find all allowed edges efficiently. 
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A Extending a set of non-adjacent edges to a matching 

The discussion in Section |2] implies that any set of k non-adjacent edges in a bipartite graph may 
be extended to a matching of size at least t — k, where t is the size of a maximum matching in G. 

Theorem A.l. Let G = {V, E) be a bipartite graph where the two parts of the graph are Vi = 

{vi, . . . , Vni} and V2 = {v[, . . . , v'^^}, ni < 722, and the maximum matchings are of size t < ni. 
Let P be a set of k non-adjacent edges in E. Then there exists a matching of size t — k in G that 
includes P. More specifically, assume that P = {{vj^-jv'^.) : 1 <i <k} where A = {ji, . . . , Jfc} C 
[ni\ and A' = {£1, . . . ,£fc} C [712] are the corresponding subsets of k distinct indices. Then there 
exists a matching of size k + t — \A[J A' \ in G that includes P. 

Proof. As before, we may assume, without loss of generality, that one of the maximum matchings 
in Gis M := {{vi^v'i), . . . , {vt,v[)}. Let B = [t] \{AVJA') denote the set of indices of edges from 
M that are not adjacent to any of the edges in P. The size of B is at least t — \Ayj A'\. Clearly, 
P U {{vi^v[) : i € B} is a matching that includes P. Its size is A; + |i?| > k + t — \AU A'\. Since 
|A U A'l < \A\ + l^'l = 2k, we infer that P is included in a matching of size at least t — k. D 

It is easy to see that the lower bound in Theorem lA.il is sharp. To exemplify that, consider the 
case where M is a perfect matching consisting of t edges, the set of k non-adjacent edges P is such 
that 1^ U ^'1 = 2k, and E = M U P. The only way to extend P into a matching is by adding 
to it edges from M. The maximal number of edges from M that could be added to P is precisely 
i — 1^ U A' I in this case. 

B All edges in a regular bipartite graph are allowed 

As noted earlier, the problem of finding a maximum matching in a d-regular bipartite graph is 
easier. The maximum matchings in such graphs are perfect. A randomized algorithm due to Goel 
et al. m finds them in time 0{n log n). However, the problem of finding all allowed edges in such 
graphs is trivial, since the set of edges in such graphs may be partitioned into a disjoint union of d 
perfect matchings, and, consequently, all edges in such graphs are allowed. 

Theorem B.l. All edges in a regular bipartite graph are allowed. 

Proof. In regular bipartite graphs, G = {V, E), where V = ViVJ V2, all nodes have the same degree 
d. Hence, such graphs ai^e balanced, |Vi| = \V2\. Let X be any subset of Vi and let N{X) denote 
the set of all its neighbors in V2. Clearly, \N{X) | > |X| since otherwise at least one of the nodes in 
N{X) would have to be of degree higher than d. Hence, Hall's maniage theorem 13 implies that G 
has a perfect matching. Let Afi be such a matching and consider the graph Gi = {V,E\ Mi). This 
graph is {d — 1) -regular, whence it too has a perfect matching, say M2. Continuing in that manner 
we may construct d disjoint perfect matchings. Mi, . . . , M^, such that E = IJi<j<d -^«- ^^ follows 
that all edges in E are allowed. D 
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